PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa03g031770.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HB-other
Protein Properties Length: 1777aa    MW: 199725 Da    PI: 4.9502
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa03g031770.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox60.42.8e-1996151257
                     T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
        Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                      kR+  t+ qle+Le+++ +++yps+++r++L++kl+Lt+rq ++WF+ rR k+kk
  Csa03g031770.1  96 PKRQMKTPFQLETLEKVYSEEKYPSEATRADLSDKLNLTDRQLQMWFCHRRLKDKK 151
                     69****************************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.1E-1978151IPR009057Homeodomain-like
SuperFamilySSF466892.57E-1687152IPR009057Homeodomain-like
PROSITE profilePS5007116.53692152IPR001356Homeobox domain
SMARTSM003895.0E-1894156IPR001356Homeobox domain
PfamPF000467.6E-1796151IPR001356Homeobox domain
CDDcd000861.51E-1497151No hitNo description
PROSITE profilePS5082718.031613672IPR018501DDT domain
SMARTSM005714.6E-24613672IPR018501DDT domain
PfamPF027919.7E-18614669IPR018501DDT domain
PfamPF050669.4E-16795862IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156128.1E-79951036IPR028942WHIM1 domain
PfamPF156134.7E-1311701242IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1777 aa     Download sequence    Send to blast
MGRGAIKEEE SEGKRKTWRW PLATLVVVFL AVAVSSRTAS NVGFFFTDRN SCSCSLQMGS  60
DEEEDQIRSV ADVVAGSNNN KKKNKIDNSS SSSAKPKRQM KTPFQLETLE KVYSEEKYPS  120
EATRADLSDK LNLTDRQLQM WFCHRRLKDK KDDQSQSKTP VKPAVPAAVR PPPPAFASSV  180
NDLPPARSVP EQDSGSGSDS GSGCSPYSDS RRNFASGSSS SRAELDEYET MVKPSYEPRL  240
SAMVRRAIVC IEAQLGEPLR DDGPILGMEF DPLPPGAFGS PIAMQKHLLH PYESKMYEPH  300
DVRPRRSQAA ARSFHEQQSL DDPSSFTPEM YGRYSENHAH GMDYEIARPR SSSFMHENGS  360
LPRSYGTPGY VSRNCSTSQQ DMPSPIVASA HRGDRFLMEK DSSVLGTEDP YMLSDGVHKS  420
NDVHRKGKIH DVRLGRGSET RENRGPKDLE KLEIQKKKNE ERMRKEMERN ERERRKEEER  480
LMRERIKEEE RLQREQRREM ERREKFLQRE NERAEKKKQK EEIRREKDAI RRKIAIEKAT  540
ARRIAKESMD LIEDEQLELM ELAAISKGLP SVLQLDHDTL QNLELYRDSL STFPPKGLQL  600
KMPFAISPWK DSDESVGNLL MVWRFLTSFS DVLDLWPFTL DEFIQAFHDY DSRLLGEIHV  660
TLLRSIIRDI EDVARTPFSG IGNNQYTTAN PEGGHPQIVE GAYAWGFDIR SWKKNLNPLT  720
WPEILRQLAL STGLGPRLKK KSSRFTHTGD KDEAKGCEDI ISTIRSGSAA ESAFALMREK  780
GLLAPRKSRH RLTPGTVKFA AFHVLSLEGS KGLTVLELAD KIQKSGLRDL TTSKTPEASI  840
SVALTRDVKL FERIAPSTYC VRAPYVKDPA DGEAILADAR KKIRAFESGL TGPEDVNDLE  900
RDEDFEIDID EDPEVDDLAT LASASKSADL DEANVLSGKG GDTMFCDVKA GVKSEIEKEF  960
SSPPPSSIKS IAPQHNERLK DTAVGCVDAM VDESNEGQSW IQGLTEGDYC HLSVEERLNA  1020
LVALVGIANE GNSIRAGLED RMEAANSLKK QMWAEAQLDN SCMRDVLKLD FQNLASSKTE  1080
STMGLPIIQS SNRERDNFGG DPSELLDEKK PLEVVSNDLQ KSTAERGLIN QEAIISQENC  1140
SFQQGYVSKR SRSQLKSYIG HKAEEVYPYR SLPVGQDRRH NRYWLFAASA SKSDPSSGLL  1200
FVELHDGKWL LIDSEEAFDT LVASLDMRGI RESHLRIMLQ KIEGSFKENA RKNMKLARNP  1260
FLKEKSVMNH SPTDSVSPSS AVSGSNSDSM ETSNSIRVEL GRNDTEKKSL SKRFHDFQRW  1320
MWTETYSSLP SCAKKYGKKR SELLATCALC VASYLSEYTH CTSCHQRLDM VDDSEILDSG  1380
LTVSPLPFGV RLLKPLLVFL EASIPDEALE SFWTEDKRKI WGFRLNASSS PEEALQVLTT  1440
LETAIKKEYL SSNFMSAKEL LGVGDADADD PGSVDVLPWI PKTVSAVALR LSELDASIIY  1500
VKPEKPDLIP EDETEQISLF PGDSLFKGKG PREQEDQDEV VPNLGNRRSN KRARVSLGSG  1560
SNKKVKRKKA QGGPNRFVVS QRNVAVDNNL MSMELNHQIP GRGKRTVRKR PERINEENDH  1620
LVNRMADIVR PKTQEVEEDE EEEEQTFRDI DEDWAAGETP REMDEDWANE TPNRMTPMQV  1680
DDESDNSVGV ESEDDDVDGQ FVDYSQRNKW GLDWNSNANE AAMEDEEEEE VVGVERVEGE  1740
DDAEISESSE DDDDVPANNA ANNYDRESEG YSSSDS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1144151RRLKDKKD
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0101550.0AC010155.3 Genomic sequence for Arabidopsis thaliana BAC F3M18 from chromosome I, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010499372.10.0PREDICTED: uncharacterized protein LOC104776906
SwissprotF4HY560.0RLT1_ARATH; Homeobox-DDT domain protein RLT1
TrEMBLD7KCW80.0D7KCW8_ARALL; HB-1
STRINGfgenesh2_kg.1__3015__AT1G28420.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM83472236
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G28420.10.0homeobox-1